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This paper is concerned with nonlinear filtering of the coefficients 
in asset price models with stochastic volatility. More specifically, we 
assume that the asset price process S = (St)t>o is given by 

dS t = m(0t)S t dt + v(6 t )S t dB t , 

where B = (Bt) t >o is a Brownian motion, v is a positive function 
and 6 — (9t)t>0 is a cadlag strong Markov process. The random pro- 
cess 9 is unobservable. We assume also that the asset price St is 
observed only at random times < n < T2 < ■ ■ ■ . This is an appro- 
priate assumption when modeling high frequency financial data (e.g., 
tick-by-tick stock prices). 

In the above setting the problem of estimation of 6 can be ap- 
proached as a special nonlinear filtering problem with measurements 
generated by a multivariate point process (jk , log S Tk ) • While quite 
natural, this problem does not fit into the "standard" diffusion or 
simple point process filtering frameworks and requires more techni- 
cal tools. We derive a closed form optimal recursive Bayesian filter 
for 8t, based on the observations of (jk, log S Tk )fe>i- It turns out that 
the filter is given by a recursive system that involves only determin- 
istic Kolmogorov-type equations, which should make the numerical 
implementation relatively easy. 

1. Introduction. In the classical Black-Scholes model for financial mar- 
kets, the stock price St is modeled as a geometric Brownian motion, that is, 
with diffusion coefficient equal to a St, where "volatility" a is assumed to be 
constant. The volatility parameter is of great importance in applications of 
the model, for example, for option pricing. Consequently, many researchers 
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have generalized the constant volatility model to so-called stochastic volatil- 
ity models, where at is itself random and time dependent. There are two 
basic classes of models: complete and incomplete. In complete models, the 
volatility is assumed to be a functional of the stock price; in incomplete 
models, it is driven by some other source of noise that is possibly correlated 
with the original Brownian motion. In this paper we study a particular 
incomplete model in which the volatility process is independent of the driv- 
ing Brownian motion process. This has the economic interpretation of the 
volatility being influenced by market, political, financial and other factors 
that are independent of the "systematic risk" (the Brownian motion pro- 
cess) associated with the particular stock price under study. Option traders, 
investment banks, economic analysts and others depend on modeling future 
volatility for their trading, economic forecasts, risk management and so on. 

Estimating volatility from observed stock prices is not a trivial task in 
either complete or incomplete models, in part because the prices are ob- 
served at discrete, possibly random time points. Since volatility itself is not 
observed, it is natural to apply filtering methods to estimate the volatility 
process from historical stock price observations. Nevertheless, this has only 
recently been investigated in continuous-time models, in particular, by Prey 
and Runggaldier [5]. See [22] for an up-to-date survey. See also [2] for a 
discrete-time approach with equally spaced observations, [6] for an approxi- 
mating algorithm in continuous time, [17] for a nonparametric approach, as 
well as [3, 10, 19] for still other approaches. There is also a rich economet- 
rics, time-series literature on ARCH-GARCH models of stochastic volatility, 
that presents an alternative way to model and estimate volatility; see [7] for 
a survey. 

Our paper was prompted by Prey and Runggaldier [5]. Like that paper, 
we assume that the asset price process S = (St)t>o is given by 

dS t = m(0t)Stdt + v(0 t )StdB t , 

where B = (Bt)t>o is a Brownian motion, v is a positive function, and 
9 = (6t)t>o is a cadlag strong Markov process. The "volatility" process 6 
is unobservable, while the asset price St is observed only at random times 
< T\ < T2 < ■ ■ ■ . This assumption is designed to reflect the discrete nature 
of high frequency financial data such as tick- by-tick stock prices. The ran- 
dom time moments can be interpreted as "instances at which a large trade 
occurs or at which a market maker updates his quotes in reaction to new 
information" (see [4]). Hence, it is natural to assume that {Tk}k>i might 
also be correlated with 0. 

In the above setting the problem of volatility estimation can be regarded 
as a special nonlinear filtering problem. 

Frey and Runggaldier [5] derive a Kallianpur-Striebel type formula (see, 
e.g., [9]) for the optimal mean-square filter for 0t based on the observations 
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of S T1 ) S T2 , . . . for all Tfe < t and investigate Markov chain approximations 
for this formula. We extend this result in that we derive the exact filtering 
equations for 9 t that allow us to compute the conditional distribution of 
9t given S^Ati <Sr 2 At, • • • • Moreover, our framework includes general random 
times of observations, not just doubly stochastic Poisson processes. 

We remark that, while being natural, the Frey and Runggaldier model 
adopted in this paper does not quite fit into the "standard" diffusion or 
simple point process filtering frameworks (cf. [12, 15, 20]) and requires more 
technical tools. In particular, the general filtering theory for diffusion pro- 
cesses requires that the diffusion coefficient of the observation process does 
not depend on the state process, while in our case the presence of 9t in 
the diffusion coefficient is crucial. The "standard" filtering theory for point 
processes is also not applicable in the present setting since the observation 
process (rj,5 ri )j>i is a multivariate process (see also Remark 2). 

It turns out that the resulting filtering equations are simpler than their 
counterparts in the case of continuous observations. In the latter case, the 
nonlinear filters are described by infinite-dimensional stochastic differential 
equations. For example, if 0t is a diffusion process, the filtering equations 
(e.g., Kushner filter or Zakai filter) are given by stochastic partial differential 
equations (see, e.g., [20]). In contrast, in our setting, the filtering equation 
can be reduced to a recursive system of linked deterministic equations of 
Kolmogorov type. Therefore, the numerical implementation of the filter is 
much simpler (see the follow up paper [1]). 

We describe the model in Section 2, state the main results and examples 
in Section 3, provide the proofs in Section 4, and present more detailed 
examples in Section 5. 

2. Mathematical model. 

2.1. Risky asset and observation times. Let us fix a probability space 
(fi, J 7 , P) equipped with a filtration F = (J 7 t)t>o that satisfies the "usual" 
conditions (see, e.g., [16]). All random processes considered in the paper are 
assumed to be defined on (f2,.F, P) and adapted to F. 

It is assumed that there is a risky asset with the price process S = (St)t>o 
given by the ltd equation 

(2.1) dS t = m{6 t )S t dt + v{9 t )S t dB u 

where B = (B t ) t >o is a standard Brownian motion and 9 = (9t)t>o is a cadlag 
Markov jump-diffusion process in K with the generator C To simplify the 
discussion, it is assumed that m{x) and v(x) are measurable bounded func- 
tions on R, the initial condition So is constant, and v(x) and So are positive. 

The process (#t)t>o is referred to as the volatility process. It is unob- 
servable, and the only observable quantities are the values of the log-price 
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process X t = log St taken at stopping times (T k )k>o, so that To = 0,Tfc < Tfc+i 
if r k < oo, and r k f oo as k f oo. 

In accordance with (2.1), the log-price process is given by 

X t = l\m{6 s )-\v 2 {6 s ))ds+ fv{6 s )dB s . 
Jo Jo 

For notational convenience, set X k := X Tk . Thus, the observations are given 
by the sequence (T k ,X k ) k > . 

Remark 1 (Note on the reading sequence). The reader interested pri- 
marily in applying our results to real data can focus her attention on Exam- 
ple 3.1, which appears to be the most practical model to work with. That 
example provides self-contained formulas for estimating the conditional (fil- 
tering) distribution of the volatility process. We report on the numerical 
results related to this example in the follow-up paper [1]. 

Clearly, the observation process (r k , X k )k>o is a multivariate (marked) 
point process (see, e.g., [8, 13]) with the counting measure 

H(dt,dy) = ^2l {Tk<oo} 5{ Tkt x k }{t,y)dtdy, 
k>l 

where o~{ Tkt x k } ls the Dirac delta- function on IR + x R. 

We introduce two filtrations related to (r k , X k ) k >Q\ {G{n)) n >o and (Gt)t>o, 
where 

g(n) :=o-{(T k ,X k ) k < n }, 

Gt:=<r(n([0,r] X T) : r < t,F e B(R)) , 

where B(M) is the Borel cr-algebra on M. 

It is a standard fact (see Theorem 31 in Chapter III, Section 3 in [8]) that 

(2.2) G Tk =G(k), A; = 0,1,..., 

and {r k } is a system of stopping times with respect to [Gt)t>o- 

Remark 2. Although G Tk contains all the relevant information carried 
by the observations obtained up to time r k , the filtration (Gt)t>o provides ad- 
ditional information between the observation times. To elucidate this point 
on a more intuitive level, we note that the length of the time elapsed be- 
tween r k and T k+ \ carries additional information about the state of Q% after 
Tfc. Specifically, if the frequency of observations is proportional to the stock's 
volatility v(9t), t £}T k ,T k+ i{, the larger values of t — T k might indicate lower 
values of v(9t). 
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2.2. Volatility process. A more precise description of the volatility pro- 
cess is in order now. Let (R,S(R)) and (R+ x R,B(R+) <g) B(R)) be mea- 
surable spaces with Borel a-algebras. The volatility process 9 = (0t)t>o is 
defined by the Ito equation 

(2.3) d9 t = b(t,9 t )dt + a(t,9 t )dW t + f u(Q t -,x)(n e -v e )(dt,dx), 



where Wt is a standard Wiener process and p? = p e (dt, dx) is a Poisson mea- 
sure on (R + x R, B(R + ) <S> B(R)) with the compensator v B {dt, dx) = K(dx) dt, 
where K(dx) is a a-finite nonnegative measure on (R, 0(R)). We assume that 
E9q < oo, the functions b(t, z), a(t, z) and u(z,x) are Lipschitz continuous 
in z uniformly with respect to other variables, and 

\b(t,z)\ 2 + \a{t,z)\ 2 + f \u(z,x)\ 2 K(dx)<C(l + \z\ 2 ). 

JR 

It is well known that under these assumptions (2.3) possesses a unique strong 
solution adapted to F, and E9 2 < oo for any t > 0. 
The generator £ of the volatility process is given by 

£f(x):=b(t,x)f'(x) + ±a 2 (t,x)f"(x) 

+ [ (f(x + u(x,y))-f(x)-f'(x)u(x,y))K(dy). 



Before proceeding with the assumptions and main results, we shall intro- 
duce additional notation. Set 

(2.4) a(s,t) = J\m{e u )-\v 2 {e u ))du 
and 

(2.5) cr 2 (s,t) = J\ 2 {6 u )du. 

For simplicity, it is assumed that v 2 (s,t) is bounded away from zero. Let us 
denote by p s ,t(y) the density function of the normal distribution with mean 
a(s,t) and the variance a 2 (s,t): 

(2.6) Ps M := 1 -(y-a(M))V(2^(M)). 

V 2vrcr(s, t) 

Clearly, p is the conditional density of the stock's log-increments Xt — X s 
given 9. 

Let = (Ft)t>o be the right-continuous filtration generated by (9t)t>o 
and augmented by P-zero sets from T . Denote by Gf, the conditional distri- 
bution of Tfc + i with respect to V Q{k) (here and below T 1 V T 2 stands 
for the u-algebra generated by the a-algebras T l and JT 2 ). That is, G d k is 
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the distribution of the time of the next observation, given previous history, 
and given 9, 

(2.7) G e k (dt) = P(T k+1 edt\Jtvg(k)). 

Without loss of generality, we can and will assume that G k (dt) is the regular 
version of the RHS of (2.7). 

Let N = (Nt)t>o be the counting process with interarrival times: tq = 0, 
(r k -Tfc_i)fc>i, that is, 

(2.8) N t = J2i(n<t). 

k>l 

2.3. Assumptions. The following assumptions will be in force throughout 
the paper: 

Assumption A.O. For every ^-predictable and a.s. finite stopping time 

S, 

P(N s -N S -^0\g S -) = or 1. 

Assumption A.l. The Brownian motion B is independent of (9,N). 

Assumption A. 2. For every k, there exists a C/(£;)-measurable inte- 
grable random measure & k on S(R+) so that for almost all w G O, $fc([0, T k (u)]) 
and G k is absolutely continuous with respect to 

Denote by cj)(r k ,t) = 4>(9,Tk,t) the Radon-Nikodym derivative of G e k (dt) 
with respect to & k (dt), that is, for almost every uj, 

(9 0^ M * dGj((r k ,t]) 

(2-9) 9{rk,t): --r^a — ttv- 

d$> k ({T k ,t\) 

Assumption A.O is not essential for the derivation of the filter. However, 
under this assumption, the structure of the optimal filter is simpler, and 
in the practical examples important for this paper, this assumption holds 
anyway. In particular, Assumption A.O is verified if the conditional distri- 
bution G k = P(T k+ i < VQ(k)) is absolutely continuous with respect to 
the Lebesgue measure or if the arrival times r k are nonrandom (more gener- 
ally, it holds if the compensator of the counting process Nt is a continuous 
process). 

The following two simple but important examples illustrate Assump- 
tion A. 2. 
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Example 2.1. Let (r k ) k >o be the jump times of a doubly stochastic 
Poisson process (Cox process) with the intensity n(9t). In this case, 



Then, one can take $ k (ds) = ds and cp(T k ,s) = n(9t) exp(— n(9 u ) du) . If 
n{0t) = n is a constant, one could also choose 

<& k (ds) = nexp{n(rfc — s)} ds and (f>(r k ,s) = l. 

Example 2.2. If the filtering is based on nonrandom observation times 
r k (e.g., Tfc = kh where h is a fixed time step), then a natural choice would 
be ^k(ds) = 6{ Tk+1 }(s) ds and (f>(r k ,s) = 1. 

For practical purposes, & k (ds) must be known or easily computable as 
soon as the observations (r^, Xi)i<k become available. In contrast, the Radon— 
Nikodym density (j>(rk) is, in general, a function of the volatility process and 
is subject to estimation. 

We note that Assumption A. 2 could be weakened slightly by replacing 
Gf. by a regular version of the conditional distribution of t^. + i with respect 
to T^ k+i _ V Q{k). The latter assumption would make the proof a little bit 
more involved and we leave it to the interested reader. 

3. Main results and introductory examples. 

3.1. Main result. For a measurable function / on R with E\f{9t)\ < oo, 
define the conditional expectation estimator nt(f) by 



where irt(dz) := dP(8t < z\Qt) is the filtering distribution. [Note that we omit 
the argument 9t of / in the estimator 7Tj(/).] In the spirit of the Bayesian 
approach, it is assumed that the a priori distribution 



Let cr{9 Tk } be the cr-algebra generated by 9 Tk . For t>T k , let us define the 
following structure functions: 



(3.2) Mf^y,9r k ):=E(f(9 t )p Tk , t (y-X k )<P(T k ,t)\a{8 Tk }vg(k)), 




t > T k , 

otherwise. 



(3.1) 




ir (dx) = P(9q G dx) 



is given. 



and its integral with respect to y 



(3.3) 




E(f(9 t ) ( p(T k: t)\a{9 Tk }vg(k)), 
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where p and <j> are given by (2.6) and (2.9), respectively. 
If / = 1, the argument / in tp and ip is replaced by 1. 
Write 

POO 

®k({Tk+l}) ■= / I(t = T k+1 )$ k (dt), 
JO 

that is, $ fc ({Tfc+i}) is the jump of $ k (dt) at r fc+1 . 
Finally, for t > and a bounded function /, define 



M k {f;t,TT t ) :-- 



-K Tk {i>k (/;*)) - n-(f)^Tk (^fc(i; *)) 



/~7r^ fc (i;0)**(<fr) 

whenever the numerator is not zero. If the numerator is zero, set J\4j-(f; t, ift) 
to be equal to zero. 

The main result of this paper is as follows: 

Theorem 3.1. Let Assumptions A.0-A.2 hold. Then for every mea- 
surable bounded function f in the domain of the generator C such that 
/q E\Cf{9 s ) \ ds < oo for any t>0, the following system of equations holds: 

(1) For every k = 0, 1, ... , 



m K Tk (j> k (f;t,y)) 
Tk+lU> 7r Tfe (^(l;t,y)) 



y=*k+i 



(3.4) 

-A4 fc (/;t,7r t )| {t=Tfc+l} $({r fe+ i}). 
(2) For every A; = 0,1,... and f G ]rfc, Tfe + i[, 

(3.5) d7T t (/) = 7T t (£/) (it - >Jfc(/; t, 7T t )*fc(dt). 

3.2. Remarks. 

1. Equations (3.4) and (3.5) form a closed system of equations for the 
filter 7Tf(/). It is often convenient and customary (see, e.g., [20, 21] and the 
references therein) to write a differential equation for a measure- valued pro- 
cess Ht(dx) in its variational form, that is, as the related system of equations 
for Ht(f) for all / from a sufficiently rich class of test functions belonging 
to the domain of the operator C In our setting, such a reduction to the 
variational form is a necessity, since in some cases the filtering measure 
TT s (dx) = P(0 S £ dx\Q s ) may not belong to the domain of C However, in the 
important examples discussed below, there is no need to resort to the vari- 
ational form. The interested reader who is unaccustomed to the variational 
approach might benefit from looking first into the examples at the end of 
this section and in Section 5, where the filtering equations are written as 
equations for posterior distributions. 
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2. The system (3.4) simplifies considerably if 



(3.6) 



Mk(f\t,T t )\{ 



t=T k + 1 



} $({r fc+1 }) = for all k. 



Obviously, (3.6) holds if, for all k,&k(dt) is continuous at t = Tk+i, as in 
the case when Nt is a Cox process. In fact, (3.6) holds true in many other 
interesting cases, even when $>k(dt) has jumps at all r^+i, as in the case of 
fixed observation intervals (see Example 5.3 below). We note then that the 
following separation principle holds. 

Corollary 1. Assume (3.6). Then the filtering at the observation times 
{ T k}k>i does not require filtering between them; it is done by the Bayes type 
recursion: 



3. Note that for high-frequency observations, even if condition (3.6) is 
not met, for all practical purposes, it may suffice to compute the volatility 
estimates only at the observation times. In that case, one would only use 
the relatively simple recursion formula (3.4), and disregard equation (3.5). 

4. Clearly, the "structure functions" tp and tp are of paramount impor- 
tance for computing the posterior distribution of the volatility process. We 
would like to stress that these do not involve the observations and could 
be pre-computed "off-line" using just the a priori distribution. Then, "on- 
line," when the observations become available, one needs only to plug in the 
obtained measurements (r^A^), and to compute TTt(f) by recursion. This 
feature is important for developing efficient numerical algorithms. 

5. Note also that, for almost every oj £ f2, filtering equation (3.5) is a 
linear deterministic equation of Kolmogorov's type, rather than a nonlinear 
stochastic partial differential equation. The latter is typical of the nonlinear 
filtering of diffusion processes. The well-posedness and the regularity prop- 
erties of equation (3.5) are well researched in the literature on second-order 
parabolic deterministic integro-differential equations (see, e.g., [11, 14, 18] 
and the references therein). 

Example 3.1 (Volatility as a Markov chain). Let us now assume that 
the counting process is a Cox process with intensity n(6t), and take 0(r&, s) = 

n(9t)e ^ and <&k{ds) = ds. Also assume 8 = (9 t )t<T is a homoge- 
neous Markov jump process taking values in the finite alphabet {a\, . . . , om} 
with the intensity matrix A = ||A(aj,Oj)|| and the initial distribution p q = 



(3.7) 



■^T k (tpk(f\t,y)) 
■K Tk (ip k (l;t,y)) | 



t=T k + l 

y=X k+1 



} 
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P(6*o = a q ) , q = 1, . . . , M. (This is one of the two models of the state process 
discussed in [5].) In this case, 

jC,f(e s ) = J2KGs,a j )f(a j ). 
j 

Denote by 9\ the process 9t starting from a,j, and 

Pji(t) := P(9 t = ai\6 = a,j), 7Tj(t) = P(9 t = aj\Q t ), 

r ri (t,z) :=E(e-Io n ^ d y ot (z)\9i= ai ), 

where P^ ot (z) is obtained by substituting 9{ for 9 S in p ot (z). It follows from 
Theorem 3.1 (for details, see Example 5.1), with f(9 t ) := Irg t - ai y, that 



(3.8) TTjfa) 



n ( a i) Ej r ji( T k ~ Tk-l,X k - X k _i)pji(T k - 7fc_]>j(7fc_i) 
J2i,j n ( a i) r ji( T k ~ Tk-l,X k - X k ^i)pji{T k - Tk-l)lTj(T k -l) 



This recursion can be easily computed, once one computes ("off-line") the 
values rij. This example is also treated in more detail in Section 5. 

4. Proofs. In the proof of the main result we want to show that 

d7T t (f)=7T t (Cf)dt + dM t , 

where Mt is a martingale, and then we find a (integral) martingale represen- 
tation of Mt with respect to the measure fi — u, where v is a compensator 
of /i. We first find the compensator. 

4.1. (Qt) -compensator of fj,. Denote by V(G) the predictable cr-algebra 
on Q x [0, oo) with respect to Q and set 

f>{G)=V{G)®B^L). 

A nonnegative random measure i>(dt, dy) onT'{Q) is called a V{Q) -compensa- 
tor of p if, for any 7 7 (^)-measurable, nonnegative function ip(t, y) = lp(oj, t, y) : 

(i) / / <p(s, y)u(ds, dy) is "P(£?) -measurable, 
Jo Jm 

(4.1) 

roo r roc r 

(ii) E / / <p(t,y)n(dt,dy)=E / / <p(t,y)v(dt,dy). 

Jo Jr Jo Jr. 
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Let Gk(ds,dx) = G k (io,ds,dx) be a regular version of the conditional distri- 
bution of (rk+i, Xk+i) given G(k) (it is assumed that G k ([0, r k ], dx) = 0): 

(4.2) G k (dt, dy) = dP(r k+1 < t, X k+1 < y\G{k)). 

Denote G k (dt) = G k {dt,R), that is, G k (t) = P(r k+1 < t\G(k)) (with proba- 
bility one). 

By Theorem III. 1.33 in [8] (see also Proposition 3.4.1 in [16]), 
(4.3, ^* ) = g Wt) _«^. 

We now derive a representation, suitable for the filtering purposes, of the 
^(^-compensator v in terms of the structure functions (3.2), (3.3) and the 
posterior distribution of 6. 

Lemma 4.1. The V (G) -compensator v admits the following version: 

(4.4) W=£WW r ^, W» 

Proof. By Assumption A.l, for t>r k , with probability 1, 
P(r fc+1 <t,A fc+1 <y|^vg(fc)) 

= E(P(r k+1 < t, X k+1 < y\T e V G(k) V cr(r fc+1 ))|^ V G(k)) 

(4.5) = E(I {Tk+l < t) P(X k+1 < y\rl V Gik) V a(r k+l ))\^L V G(k)) 

ry 





i<t) 


rt ry 






Pr k , 


J Tfr J — OO 





p Thi7h+1 {z-x k )dzj^ \fg(k) 



OO 



where we recall that Gf. is a regular version of the conditional distribution 
of r k+ i with respect to V G(k). Thus, by Assumption A. 2, for t > r k , 
with probability 1, 

P(T k+1 <t,X k+1 <y\J* vg(k)) 

(4.6) 

rt ry 



Pr k ,s( Z ~ X k )(p(T k ,s)dz$ k (ds). 

r fc J oo 

By (3.2), using notation (3.1), we see that 

i?(^[ < /)(r fc , S ) /0Tfc , s (z-A fc )| C T{^}vg(fc)]|g(A;))=7r rfc (^(l;s^)). 
This, together with (4.6), yields, recalling definition (4.2), 
(4.7) G k (ds,dz) =ir Tk (ifj k (l;s,z))$ k (ds)dz. 
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In the same way, for t> Tk, with probability 1, 

POO 

(4.8) G k ([t,oo],R) = J 7^(^(1; s))* fc (cfo). 

This completes the proof. □ 

Remark 3. If the right-hand side of (4.8) is zero, then 

Pfa+i>W0) = o- 

Hence, /j Tfc Tfc+i j (t) = with probability 1 and, by the 0/0 = convention, 
the corresponding term in (4.4) is zero. 

4.2. Semimartingale representation of the optimal filter. In this section 
we will prove the following result. 

Theorem 4.1. For any bounded function f from the domain of the 
operator C such that Jq E\Cf(9 s )\ ds < oo for all t < oo, the ltd differential 
of the optimal filter vr s (/) is given by equation 

d-Ks(f) =ir s (£f)ds 



x (M- v){ds,dy). 

Proof. It suffices to verify the statement for twice continuously differ- 
entiable functions / with /, /'/" bounded. By Ito's formula, 



m) = f(9o)+ f Cf{6 s )ds+ f f'{e s )a(9 s )dW s 
Jo Jo 



+ / / (f(0 a - + u(0,-,x)) - f(9 s -))(^ -v y )(ds,dx). 
Jo Jr 



Denote 



L t = [ f'{0 s )o-(6 s )dW s 
Jo 

+ f [ (f(O s - + u(0 s -,x))-f(e s -))(fi e -i, e )(ds,dx). 
Jo Jr 

Then, we have 

7r t (/) = E(f(e )\g t ) + E^cf(e s ) ds\g?j + E(L t \g t ). 
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Set 



M t = {E(f(e )\g t )-7r (f)} 

+ {E(J*£f(O s )ds\g t }- j\ s {Cf)ds\ + E{L t \g t ). 

Obviously, the process E(f(6o)\Gt) — tto(/) is a ^-martingale. Process L t is 
a ^-martingale. Since Gt^ Ft, for t>t', 

E{E{L t \g t )\g t ,) = E{E{L t \J : t>)\Gt')=E(L t ,\g t ,). 

Consequently, E(L t \gt) is a martingale too. Finally, ElJ^ £f(6 s ) ds\Gt) — 
/ 7r s ((£/)) ds is also a ^t-martingale. Indeed, for t > s > t', we have E(ir s (£f)\ 
g t ,) = E{Cf{6 a )\g v ), which yields 

g t 



E 



E 



£f(9 s )ds 
t 1 

o 



El / Cf{9 s )ds 



ir s (£f)ds g t > 

-I" 

Jo 



ir s (£f)ds. 



Thus, Mi is a C/j-martingale. In particular, this means that nt(f) is a ^-semi- 
martingale with paths in the Skorokhod space Dr 0iOO )(K), so that 7Tt{f) is a 
right continuous process with limits from the left. By the martingale rep- 
resentation theorem (see, e.g., Theorem 1 and Problem l.c in Chapter 4, 
Section 8 in [16]), 

ft 



M t 



H(s,y)(n - v)(ds,dy). 



It is a standard fact that P(N S - N S - / 0\Gs~) = v({S},R+). Hence, due to 
Assumption A.O, by Theorem 4.10.1 from [16] [see formulae (10.6) and (10.15)], 



(4.10) 



H(t,y) = Ml(AM\V(G))(t,y), 



where AM t = M t — M t _ and the conditional expectation M^(g\V(G)) is 
defined by the following relation (see, e.g., [16], Chapter 2, Section 2 and 
Chapter 10, Section 1): for any "P(£/)-measurable bounded and compactly 
supported function ip(t,y), 



E 



E 



<p(t,y)gtfJ>{dt,dy) 

ip(t,y)M p Jg\V(g))(t,y)v(dt,dy). 



By Lemma 4.10.2, [16], 

(4.11) Mj(7r t (/)|^))(<,») = Mj(/|W))(t,y). 
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Since vr t _(/) is V{Q) -measurable [which implies M^(ir_(f)\V(G))(t, y) = 
7r t _(/)],by (4.11), 

M*(AM\V(g))(t,y) 
(4-12) = U p ^ t {f)-^ t .{f)\V(Q)){t,y) 

= Ml(f\f{Q)){t,y)-n-{f). 
To complete the proof, one needs to show that 

,4,3) M W »»)-EWlW^ 

To prove (4.13), it suffices to demonstrate that, for any 7 3 (^)-measurable 
bounded and compactly supported function <p(t,y), 

£> J(Tk,T k+1 ]n(T k ,oo)JlBL 7r Tk ^ k (l;t,y)) 

(4.14) 

v ' poo r 

= E / <p(t,y)f(OMdt,dy). 



By monotone class arguments, we can assume that ip(t,x) = v(t)g(x), 
where v(t) is a "P(C?)-measurable process and g(x) is a continuous function 
on M. By Lemma III. 1.39 in [8], since v(t) is "P(<5)-measurable, it must be 
of the form 



CO 



(4-15) v(t) = v + Y,v k (t)I jTk>Tk+lj (t), 

k>l 

where vq is a constant and v k (t) are Q(k) <8> i3(K+) -measurable functions. 

Owing to (4.15) and Lemma 4.1, in order to prove (4.14), it suffices to 
verify the equality 



E 

(4.16) 



(T k ,T k+1 }n(T k ,oo)JR Tr Tk {ip k {l;t,y)) 



= E[v k (T k+1 )g(X k+1 )f(6 Tk+1 )l {Tk+1<oo} ]. 
The next step follows the ideas of Theorem III. 1.33 in [8]. We have 
E[v k (T k+l )g(X k+1 )f(6 Tk+1 )l {Tk+1<oo} ] 

= s[£;(^(r fc+1 ) 5 (x fc+1 )/(^ +1 )i {Tfc+1<oo} |g(fc)v^)] 



E[ / / v k (s)g(y)E[f(9 s )Gi(ds,dy)\g(k)] 
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where, as before, G e k {ds , dy) is the regular version of the conditional distri- 
bution of (rfc+i,Xfc+i) with respect to V Q{k). 
By Fubini's theorem, and recalling notation (4.2), 



E[ / / v k (s)g(y)E[f(9 s )Gl(ds,dy)\g(k)) 

'(Tfc,Oo) 



e{ I I v k (s)g(yY- 

\J(T k ,oo) JR 



E[f(e s )G e k (ds,dy)\g(k)] 



i(r k ,oo)M G k ([s, oo];R) 

G k (du,] 

[s,oo] 

E[f(9 s )Gl(ds,dy)\g(k)] \ 



(4.17) 



= E[ / v k {s)g{y) 

\Jr k Jr G k ([s,co\;R) J 

By (4.6), 

(4.18) G k (ds,dy) = p Tk)S (z - X k )cf>(T k ,s)<S> k {ds)dy. 

Hence, for s > r k , 
E[f{8 s )G e k {ds,dy)\Q(k)} 

= E(E(f(6 s )p Tk , s (y - X k )cj)(T k , s)\a{9 Tk } V g(k))\g(k))$ k (ds) dy. 
= K Tk {il) k {f]s,y))dy<$> k (ds). 
This, together with (4.8), yields 

\Jr k Jr G k ([s,oo\;R) J 

tp( f Tk+1 f l w ^ ^r k (Mf;s,y))dy \ 
= E[ / / v k (s)g(y) k - $ fc (ds) , 

\Jr k Jr J s ■K Tk {il){l;t))§ k {dt) J 

so that (4.16) is satisfied, and the proof follows. □ 

4.3. Proof of Theorem 3.1. In this section we show that Theorem 3.1 
follows from Lemma 4.1 and Theorem 4.1. 

Proof. First, we note that the stochastic integral in the RHS of (4.9) 
can be written as the difference of the integrals with respect to p and v. 
Indeed, since / is bounded, this follows from [8], Proposition II. 1.28. 

By applying Lemma 4.1 and integrating over y, one gets that, for t £ 
jr k ,T k+ ij, 

n Tk (ipk(f;s,y)) 



tit-, ^ )v{ds,dy) 

tf7^fc(/;*))-^-(/>7*(^fc(i;*)) 



(n,t] /~7r Tfc (^ fc (l;u))$ fe (du) 
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This equation verifies that (3.5) follows from the semimartingale repre- 
sentation (4.9), for t between the consecutive observation times. 
For the jump part (3.4), we note that 

/ / n s -(f)ii(ds,dy)= 7I Wi)-(/) 
Jo Jm. ^. 



and 



lo JM vr Tfc (^ fc (l;s,y)) 
Now, (4.9) can be rewritten as follows: 

n(f) = Mf)+ f^ s (Cf)ds 

Jo 

(4.19) 



r s=r k+1 

1 y=x k +i 



+ E 



E 

fc>0 



7Tr fc (^fc(i;s,y)) 

.M fc (/;s,7r s )$ fc (da). 



'(r fc ,tAr fe+1 ] 

Suppose t G ]r fc , r fc+1 [. Then, 

**(/)=*■■*(/) + f\ a {Cf)da- [* M k (f;s,x 8 )$ k (d8). 

J Tk J Th 



It follows that 

7r (r fe+1 )-(/) 



7Tr fc (/)+ r +1 7T s (Cf)ds 
Jr k 



i T k+l)' 



M k (f;s,ir s )$ k (ds) 



Therefore, from (4.19), 

^Tk + l \J ) 



-K Tk (ip k {l;s,y)) 



1 j/=X fc+ / 



A4fe(/;i,7r t )| {t=rfe+l} $({r fc+ i}). 



This completes the proof. □ 



5. Examples. In this section we consider some important special cases 
of Theorem 3.1. 



Example 5.1 {Markov chain volatility and Cox process arrivals). Recall 
the setting of Example 3.1 and its notation r^, Hj{t) and 9 3 . It follows from 
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Example 2.1 that in this case & k ({r k+ i}) = for all k. Hence, the second 
term in the RHS of (3.4) is zero. By (3.2), for f(0t) = l{0 t=a< } an d t > T k, 

^(/;t,y,^J = n(a0^(J {et=Oi} e-/>^% S;t (y-x)|^)] 

X—X k 

Thus, owing to the homogeneity of 9t, for t > T k , 

nTk(ipk(f;t,y)) 

= Y j n{a i )E{I {et=a . } e~ J* nidu)du p Stt (y - x)\9 s = a>j) =Tk flj-fa) 



k 



= Y j n{a l )E[I {el =ai} E(e-Io~ a ^ d y ot _ s (y- x )\9l s )] ^-(r fc ) 

j { *=x) 

= ^2n(ai)rji(t -T k ,y- X k )pji(t - T k )iTj{r k ). 
j 

Similar formula holds for the denominator of the first term of the RHS of 
the equation. Now equation (3.8) follows from (3.4). 

Mimicking the previous calculations and using the notation 

r 3l {t) :=E(e-fo n ^ )du \el=a t ), 
it is readily checked that, for t > T k , 

™T h (i>k{'*-{6t=ai}\ t )) =n(a i )^2'Kj(Tk)fji(t-Tk)pji(t -T k ) 

j 

and 

TfTTtW'jfcOM)) =^2^j{T k )n(a i )f j i(t - T k )pji(t - T k ), 
hi 

which are needed in computing (3.5). It is easily verified that, in the setting 
of this example, equation (3.5) reduces to the following: 

diTi(t) = \(a,j , ajjiTj (t) dt 

(5-1) 

+ D(T k ,t)TT i (t)dt + D i (T k ,t)dt, 
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where 

„ , x s n(ai) Ej rji(t - T k )pji{t - T k )-nj(T k 

Di(T k ,t) ^ 



St° Sij- n(ai)rji(s - T k )pji(s - T k )itj(T k ) ds ' 

V, ,7 

D(r k ,t) 



n(ai)rji(t - r k )pji(t - n^jin) 



I^Y,i,j n { a i) r ji{s ~ T k )pji{s - T k )TTj(T k )ds' 

Note that equation (5.1) is considered for a fixed u and t > T k {uj). Therefore, 
Tfc and 7r.(rfc) should be viewed as known quantities. 

Example 5.2 (Poisson arrivals). Let 9 be still the same as in Exam- 
ple 5.1. Suppose that the interarrival times between the observations are 
exponential with constant intensity n(9) = A. In other words, Nt is a Pois- 
son process with constant parameter A. In this case, the volatility process 
9 is independent of Nt. Then, on the interval r k <t < T k+ i, equation (5.1) 
reduces to 

diri(t) = ^ A(aj, ai)xj(t) dt 
(5.2) 3 

~^{^Pji{t ~ T k )TTj(T k ) - 7Tj(*)^ dt. 

On the other hand, owing to the independence of N and 9, it is readily 
checked that on the interval T k <t < T k+ i, 

i 

Therefore, the filtering equation (5.2) is simply the forward Kolmogorov 
equation for 9. 

A similar effect appears also in the following example. 

Example 5.3 (Fixed observation intervals). Assume for simplicity that 
the Markov process 9t is homogeneous. Also assume that r k = kh, where h 
is a fixed time step. Notice that 

(5-3) Q t = Q{k) for any t G [Tfc,Tfc+i[. 

Denote by P(t, x, dy) the transition probability kernel of the process 6t, given 
that 9q = x, and let Tt denote the associated transition operator. 
In accordance with Example 2.2, one can take 



4>(r k ,t) = l and <f> k (dt) = $ {rk+l} (t) dt. 
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Thus, we get 

(5.4) <l> k (f;t,y,0 Tk )=E[f(e t )p Tk , t (y-X k )\a{O Tk }vg(k)], 

(5.5) Mf;t,0 Tk )=T t _ T J(O Tk ):= J f(y)P(t - r k ,6 Tk ,dy). 

Since <b k {dt) = on [[T/cTfc+i [, (3.5) is reduced to the forward Kolmogorov 
equation 

subject to the initial condition vr Tfc (/). The unique solution of this equation 
is given by n t (f) = vr Tfe (T t _ Tk f), t < r k+1 . Hence, 

(5-6) 7T Tk+1 ^f)=7T Tk (T h f). 

Since 4>(T k ,t) = 1, the denominator of M. k is equal to 1 when t = T k +i- This 
together with the formula ^({r k+ i}) = 1 yields 

(5.7) M k (f-An)\t=r k+ M{n+i}) = ^ Tk (T h f)-^ Tk+1 M)- 

Owing to (5.7), we get M k (f;t,TT t )\ t=Tk+1 ^({r k+1 }) = 0. 
This yields the following recursion formula: 



T^+i (/) 



KT k (ipk(f;t,y)) 



TT Tk (ip k (l;t,y)) 



t=T k +i,y=X Th 



k E (f(°t-T k )po,t-r k {y - z)\0 o = z)ir Tk (dz) 



f R E(p 0)t _ Tk (y - z)\e = z)7r Tk (dz) 



t=r k+1 ,y=X- 



fe+i 
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